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[57] ABSTRACT 

A storage system to be connected to a large-scale computer 
includes a plurality of first logical units connected to a host 
device, a plurality of second logical units connected to a 
storage device, a plurality of cache memories, and a com- 
mon bus wired between these logical units and memories. 
The plurality of first logical units, the plurality of second 
logical units, and the plurality of cache memories are all 
made in the form of modules. The modules are detachably 
mounted to the common bus disposed on a back plane. The 
storage device can be made up of a plurality of small-size 
storage units arranged in an array. Thus, the storage system 
realizes its scalability. Since the plurality of first logical 
units, the plurality of second logical units, and the plurality 
of cache memories are duplexed and the common bus is 
made in the form of 2 channels, the storage system can 
perform degrade operation. Since plurality of first logical 
units, the plurality of second logical units, and the plurality 
of cache memories allow hot replace, the storage system can 
realize its non-stop maintenance. 

27 Claims, 17 Drawing Sheets 
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STORAGE SYSTEM REALIZING 
SCALABILITY AND FAULT TOLERANCE 

This application is a continuation of application Ser. No. 
08/267,013, filed on Jun. 21, 1994, now abandoned. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to storage systems which 
include a storage controller for controlling storage devices 
such as a magnetic tape device, a semiconductor storage 
device, or an optical disk device connected to a large-scale 
computer, a network system or the like. More particularly, 
the present invention relates to a storage system which is 
highly extendable and which allows degraded operation and 
hot replace. 

2. Description of the Related Art 

A conventional storage system connected to a large-scale 
computer is disclosed in, for example, JP-B-61-43742 
(corresponding to U.S. Pat. No. 4,636,946). In the system 
disclosed therein interfaces (host adaptors) with a higher- 
rank or host device (computer) (CPU), a cache memory, and 
interfaces (disk adaptors) with a storage device such as a 
magnetic disk device are interconnected by hot lines 
(dedicated lines). 

FIG. 20 shows a schematic arrangement of a conventional 
storage system. In the drawing, reference numerals 201-1 to 
201-/1 denote host adaptors connected respectively to a 
plurality of host computers (CPUs) (logical modules con- 
nected to a host computer), 202-1 to 2-2-/1 denote disk 
adaptors (logical modules connected to a storage device) 
connected to a shared large-scale disk device 205, 203 
denotes a cache memory snared with the plurality of host 
adaptors, and 206 denotes a shared management memory. In 
a conventional storage system, hot lines 207-1 to 207-n and 
208-1 to 208-/1 are connected between the host adaptors 
201-1 to 201-/2 and cache memory 203, between the cache 
memory 203 and disk adaptors 202-1 to 202-/1, between the 
host adaptors 201-1 to 201-zi and management memory 206 
and between the management memory 206 and disk adap- 
tors 202-1 to 202-/1 respectively. Further, maintenance pro- 
cessors (S VPs, not shown), which carry out monitoring and 
maintenance operations over these host adaptors and disk 
adaptors, are connected to respective host and disk adaptors 
through respective dedicated lines. 

In the prior art system described above, since the hot lines 
are wired between the host adaptors (logical modules con- 
nected to the host device) to the host device, the disk 
adaptors (logical modules connected to the storage device) 
to the storage device, and the cache memory (cache memory 
module), the system configuration becomes more 
complicated, and the host adaptors, cache memory, disk 
adaptors and disk device are poorer in extendability, which 
makes it impossible to realize a so-called scalable 
(extendable and reducible) system configuration. Further, in 
prior art systems, no consideration is paid to the fact that 
multiplexing of the system enables degraded operation (one 
of the system multiplexers is stopped and the other alone is 
operated) at the time of failure occurrence or Hot replace (a 
substrate or a circuit part is inserted for its exchange while 
the system is being operated). For this reason, prior art 
systems such as the one described above have a problem in 
that, when it is desired to perform an exchange or replace- 
ment of parts at the time a system failure occurs or when it 
is time to upgrade a system control program, the entire 
system must be temporarily stopped to perform that purpose. 
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SUMMARY OF THE INVENTION 

Therefore, it is an objective of the present invention to 
provide a storage system which solves the above problem in 
the prior art by employing a common bus system in such a 

s manner that logical modules such as host adaptors and 
memory adaptors, a cache memory, and a storage medium 
can be connected according to the system configuration 
(scale or size) to thereby realize a scalable system. A further 
objective of the present invention is to provide a storage 

10 system which can realize degraded operation by multiplex- 
ing respective logical modules, a storage medium and a 
common bus, and also which can realize hot replacement of 
the respective logical modules and storage medium to 
thereby allow system maintenance without halt of the sys- 

15 tem - 

In accordance with one preferred embodiment of the 
present invention, there is provided a storage system which 
comprises a plurality of logical units connected to a host 
device to form interfaces with the host device, a storage unit, 

20 a plurality of logical units connected to the storage unit to 
form interfaces with the storage unit, and cache memories 
(shared with the plurality of logical units connected to the 
host device and with the plurality of logical units connected 
to the storage unit) for temporarily storing therein data to be 

25 transferred between these devices and units. The plurality of 
logical units connected to the host device, the plurality of 
logical units connected to the storage unit, and the cache 
memories are interconnected by a common bus which is 
shared with these devices and units. As a result, there can be 

30 obtained a scalable system which realizes extension and 
modification of the plurality of logical units connected to the 
host device, the plurality of logical units connected to the 
memories, and the cache memory merely adding or modi- 
fying these on the common bus, and which realizes easy 

35 attainment of its upgrade based on the extended system 
provision. 

Since the plurality of logical units connected to the host 
device, the plurality of logical units connected to the 
memories, and the cache memories are arranged in a 

4Q duplexed form and the common bus is wired between these 
logical units and memory as divided into 2 channels, even 
when one of these units becomes faulty, the other unit can 
be used to perform degrade operation. In this connection, 
information indicative of the degraded operation status at the 

4 5 time of a failure occurrence is written in the shared memory. 
In this case, since any of the plurality of logical units 
connected to the host device, the plurality of logical units 
connected to the storage unit, and the cache memories are 
provided with a connector which allows for hot replacement, 

50 the system allows maintenance and inspection to occur for 
exchange of a faulty parts and also allows for the addition of 
parts to facilitate system extension, all without having to halt 
the system while in operation. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 conceptionally shows a schematic arrangement of 
a storage system in accordance with an embodiment of the 
present invention; 

FIG. 2 is a detailed arrangement of the storage system of 
60 the embodiment of the present invention; 

FIG. 3 is a diagram for explaining data flow and data 
format in the arrangement of FIG. 2; 

FIG. 4 is an appearance view of the embodiment of the 
present invention; 
65 FIG. 5A is a front view of a control unit in the system of 
the embodiment of the present invention, showing an 
example of how the control unit is mounted; 
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FIG. 5B is a side view of the control unit in the system of storage medium interface and a duplexed microprocessor for 

the embodiment of the present invention; performing control and management over the means. The 

FIG. 6A is a front view of an array disk unit in the system d^k adaptor 2 has an address operation function for storage 

r .l. • j * , c *u „ * ■ ♦* «« of data in the storage medium, a function of creating 

of the embodiment of the present invention, showing an V, *_ 4 , A r & . ' , t - .. % 

*j <; redundant data for ensuring memory data, a lunction or 

example of how the control umt is mounted; 5 r c . B , t * ^ ' e . . 

y recognizing information on the structure of the storage 

FIG. 6B is a side view of the array disk unit in the system medium, and a duplexed microprocessor for performing 

of the embodiment of the present invention; control and management over these functions. 

FIG. 7 is a wiring diagram of a logical unit frame part in In FIG. 1, the host adaptor writes write data received from 

the system of the embodiment of the present invention; the host device (CPU) and information on the management 

FIG. 8 is an exploded perspective view of the logical unit of the write data once in the cache memory package 3 via the 

frame part in the system of the embodiment of the present common bus 4, and after completing the writing operation, 

invention, showing how the logical unit frame part is issues or reports to the host device of the completion of the 

mounted - writing operation. In an idle time after that, the disk adaptor 

FIG. 9 is a configuration of software applied to the is * reads f <f data from ****** memory package 3 on the 

embodiment of the present invention; bas f of * e ™«>agement information of the cache memory 

package 3. 

FIG. 10 is a diagram for plaining data flows and shared ^ host a wh)jn [ax a data Kad 

software functions in the embodiment of the present inven- from ^ ^ ^ ^ ^^^g data ^ present 

tl0D ' 20 on the cache memory package 3, does not perform its 

FIG. 11A is a diagram for explaining a duplex common rea ding operation of it from the array disk 5 but transmits the 

bus in the embodiment of the present invention; data on the cache memory package 3 to the host device. On 

FIG. 11B is a diagram for explaining degraded operation the other hand, when the data is not present on the cache 

in the embodiment of the present invention; memory package 3, the disk adaptor 2 writes the data and 

FIG. 12 is a diagram for explaining the duplex and 2 $ management information thereof on the cache memory 

degraded operation of each of parts in the system of the package 3 from the array disk 5 through the common bus 4. 

embodiment of the present invention; The host adaptor 1 refers to the management information, 

FIG. 13 is a diagram for explaining multiplex and reads out data from the cache memory package, and trans- 
degraded operation of a power supply system in the system mits lt to me host device * 

of the embodiment of the present invention; 30 number of the host adaptors 1, the number of the disk 

FIG. 14 show a structure of a single magnetic disk device ada P tors 2 and number of the cache memories in the 

used in an array disk- cache memor y P ack *g e 3 0Q to common bus 4 can be 

„, . , respectively arbitrarily changed. When the mounting num- 

FIG. 15 shows the storage capacity of the magnetic disk ber of the host ad t0fS x fe changed) me number of buses 

unit and the system performance of the array disk; ^ connected t0 the host devices ^ also changed) ^ mat the 

FIG. 16 is a structure of a small-size disk array provided data transmission capability to the host device can be 

with a high-performance, large-capacity cache memory; enhanced. When the mounting number of the disk adaptors 

FIG. 17 is a structure of a large-size disk array provided 2 is changed, the number of buses connected to the storage 

with a high-performance, large-capacity cache memory; medium is also changed, so that the data writing/reading 

FIG. 18 is a structure of a high-performance fault tolerant 40 transmission capability to the storage medium can be 

server system; enhanced. At the same time, the number of storage medium 

™ „ A n . - . can be also increased. When the mounting number of cache 

FIG. 19 is an arrangement of an inexpensive server ^ mw^ u tiu 6 u m 6 muuuuug uuiuuvi 

system- and * memories in the cache memory package 3 is changed, the 

„„ * . P capacity of the cache memory package as a temporary data 

FIG. 20 is a schematic arrangement of a prior art storage ^ stQrage & ds0 changed) M ^ a rado of the capadty of the 

system. cache memory package to the total capacity of the storage 

DESCRIPTION OF THE PREFERRED medium can be increased, which results in that there can be 

EMBODIMENTS realized a scalable system arrangement which can increase 

a probability (which will be referred to as "cache hit rate", 

An embodiment of the present invention will be explained 50 hereinafter) that data accessed by the host device is present 

with reference to FIGS. 1 to 18. on the cache memories. 

FIG. 1 shows a conceptual diagram of a system of the FIG. 2 is a detailed arrangement of the conceptual dia- 

present invention. The present embodiment will be briefly gram of FIG. 1. In FIG. 2, only one of the plurality of host 

explained by referring to FIG. 1. adaptors 1 and only one of the plurality of disk adaptors 2 

In the drawing, reference numeral 1 denotes a host 55 is illustrated and the other adaptors are omitted, 

adaptor as a logical module connected to a host CPU (host The host adaptor 1 includes a signal converter 6 for 

computer), 2 is a disk adaptor as a logical module connected converting an optical signal of the host interface into an 

to a storage medium, 3 is a cache memory package (cache electric signal, a format converter 7 for converting the 

memory modules) for temporarily storing therein data to be format of data of the host device into a format suitable for 

transferred between the both modules, 4 is a common bus for 60 the array disk 5, and a data transmission controller 8 for 

controlling data transfer between the host adaptor 1, disk controlling data transfer to the common bus 4 and having a 

adaptor 2 and cache memory package 3, 5 is a group of storage buffer therein for storage of a packet transmission 

magnetic disks (each of which will be hereinafter referred to unit of data. The host adaptor 1 also includes a bus driver 9 

as "array disk**) as a storage medium vertically and hori- of a small-amplitude current drive type (which driver will be 

zontally arranged in an array. The host adaptor 1 has a means 65 referred to as "the BTL M , hereinafter) allowing hot replace, 

for converting the data format and address format of the host Note that "BTL" is a registered trademark of National 

interface side into data and address formats suitable for the Semiconductor Ltd. 
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Adata transmission request from the host computer is sent inter-back-plane interconnection cable 23, and a high speed 

to a microprocessor 10 (which will be referred to merely as I/O bus 24c connected to the cache memory package 24a 

"the MP 10", hereinafter), and data transmission control and cache port package 24b for performing high-speed data 

within the host adaptor 1 is put under control of the MP 10. transmission therethrough. The cache memories within the 

In order to secure a high reliability with which a failure 5 additionally-provided unit 24 are arranged so as to be 

occurrence of the MP 10 is detected, the host adaptor 1 has, accessed via the cache port package 22 and cable 23. The 

in addition to the MP 10, a second microprocessor MP 10' provision of the cache unit 24 enables an increased cache 

similar to the MP 10. A checker 11 performs comparison capacity of up to 8 GBx2. FIG. 2 shows such a case that, in 

between the MP 10 and MP 10' addition to the provision of 2 of the cache memory packages 

Aboot device 12 for storing therein a program for control 10 3 > the cache P ort P«jbge 22 is mounted which is connected 

of the MP 10 employs a rewritable large-capacity flash to <* che UIUt 24 throu S h the cable 23 ' 

memory. The MP 10 produces a copy of the control program Such host adaptors 1, disk adaptors 2 and cache memory 

on a local memory 13 as necessary, whereby a memory package 3 as mentioned above are interconnected by the 

access time to the MP 10 can be made short or an access common bus 4 which includes a multiprocessor (which will 

speed can be made high. Apart 29 enclosed by a broken line 15 be referred to as the M bus, hereinafter) 25 through which 

in FIG. 2 refers to a channel adaptor module. The host the MP 10 of each adaptor accesses the shared memory and 

adaptor 1 has such modules 29 corresponding to two cir- a high-speed (fast) I/O bus (which will be referred to as the 

^jts F bus, hereinafter) 26. 

The disk adaptor 2 includes a buffer memory 14 for The F bus 26 is operated usually on a 64-bit-width, 

storing therein sector units of data to be written in the array 20 2-channel, parallel basis so that, when one of the 2 channels 

disk 5, a data control buffer 15 for controlling the buffer becomes faulty, the other performs degrade operation, 

memory 14 and data transmission control, a redundant data Further, when the M bus 25 becomes faulty, either one of the 

generator 16 for generating redundant data for ensuring the two channels of the F bus 26 is used as the M bus and the 

data to be written in the array disk 5, and an initiator (SCSI remaining other channel is used as the F bus. 

master side interface) 17 to the array disk 5 (target). 25 Further, the element BTL 9 allowing hot replace (at the 

Data transmission control within the disk adaptor 2 is time of inserting or removing a part, the inserting or remov- 

carried out under control of an MP peripheral (including the m operation can be carried out with a small load of the 

MP 10, MP 10', checker 11, boot device 12 and local inserted or removed part, for which reason the part inserting 

memory 13 and having a control program for the disk 3Q or removing operation can be done under the activated 

adaptor 2 therein) having the same structure as the host system) is used as an interface with the common bus 4. 

adaptor 1 When the host adaptor 1 becomes faulty, the host adaptor is 

Although only 4 disks (targets) are illustrated as the array closed That is, the system closes the corresponding faulty 

disks 5 in FIG. 2, the array disks are actually made up of, for b f and instead . use \ te other normaUy^peratmg host 

example, 4 (horizon tal)x4 (vertical) to 4 (horizontal)x4 35 ada P^ r 1 . t0 c <f mue ^nous to the array disk 5 from the 

(vertical) of disks for the single disk adaptor 2. Each of the host devi <* C he sa ™ The mamtenance operator 

horizontal rows forms an error correction circuit group removes the host adaptor 1 m which a failure occurred 

(ECC group) which is made up of, e.g., 3 data disks and a du ™? the °P^ atl0 ° of the s y ste * and which is put in its 

single parity disk Further, three are a plurality of buses closed state - hereafter, a normal host adaptor 1 is inserted 

between a set of such array disks 5 and the disk adaptor, 40 10 the and a restoration instruction is given from a 

which are connected to at least 2 or more of the disk adaptors maintenance ^processor ^which will be referred to as the S VP, 

2, which will be described later. Also there are a plurality of hereinafter) 27 to a LAN 28 so that the system checks the 

buses between the CPU and the host adaptor 1, which are operation of the exchanged host adaptor 1 m such a manner 

connected to at least 2 or more of the host adaptors 1. And ^ host ada P lor 15 nOTraal » the system restores the 

when a failure takes place in one of the host adaptors, an 45 *>ed bus to realize non-stop operation, n the drawing, 

access from the same CPU to the same array disk 5 can be reference symbol LANC denotes refers to LAN Controller 

realized through the other host adaptor 1 or the other disk (SVP interface controUer). TTie SVP 27 is similarly con- 

adaptor 2 nected also to the other host adaptor 1 and the disk adaptor 

_ " i 1- 1 j i_ j 2 for monitoring and maintenance. 

The cache memory package 3 includes a shared memory ° . . ■ . , 

18 for storing therein various sorts of management infor- so When the control program of each adaptor is required to 

mation and accessible commonly by the MPs 10 of the be modified the SVP 27 rewrites the contents .of the control 

adaptors, a shared memory controller 19, a cache memory P**™* Wlthm me boot device 12 throu S h ^ ^ 28 to 

20, and a cache memory controUer 21. Each of the both enable non-stop upgrade. 

memory controllers 19 and 21 have or embed an ECC That is, when it is desired to upgrade the control program 

generation circuit for ensuring memory write data and an 55 of me system, the control program within the boot device 12 

inspection/correction circuit for read data. The entire cache of the host adaptor 1 and disk adaptor 2 for the program 

memory package 3 realizes a cache capacity of up to 1 GB, upgrade is first rewritten. After the rewriting of the control 

and in order to a duplexed cache memory, 2 of the cache program is completed, the adaptor having the rewritten 

memory packages 3 are mounted in the system. control program is reset to perform exchange of the system 

When it is desired to further increase the capacity of the 60 control program, 

cache memories, in place of the cache memory package 3 (or FIG. 3 is a diagram for explaining data flow and ensured 

in addition to the cache memory package 3), a cache port data in the arrangement of FIG. 2. 

package 22 is mounted so as to be connected to a cache unit When data is written in the array disk 5 from the host 

24 through a cable 23 for interconnection between back device, information on a physical address (which will be 

planes (plates for substrate insertion). The cache memory 65 referred to merely as PA, hereinafter) in a memory space as 

unit 24 has a cache memory package 24a having cache a write destination and then data (CKD (count key data) 

memories 20', a cache port package 24b connected to the format) plus a CRC (cyclic redundancy check) code are sent 
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from, e.g., ESCON (trade name of IBM ltd.). These optical cooling fans located as distributed therein which move air 

signals are converted at the signal converter 6 into electric from its floor side to the ceiling side for ventilation cooling, 
signals and parity is also generated. The format converter 7 fig. 7 is a wiring diagram of the logical unit frame part 

converts a data format into an FBA (fixed blocked explained in FIGS. 5A and 5B. 

architecture) format and attaches an LRC (longimdinal 5 i n the drawing, reference numeral 71 denotes a back plane 

redundancy check) code thereto, takes the PA as part of the ^late for mbsimc insertion) having the common bus 4 

data to generate a logical address (which will be referred to ^ k the form of a pfiated drcuit there0D( n a mmctot 

as LA, hereinafter), attaches panties to all these information, for interconnection between the each adaptor, package and 

and then sends it onto the F bus 26. ^ack plane 71 

The cache package 3 attaches an error correctable ECC to io ^ data ^ host ad ± di&k 

the data received from the F bus 26 and writes it m the cache adaptore 2 and cache memory package 3 fc carrjed out 

memory 2U. through the common bus 4, each adaptor and package can be 

The disk adaptor 2 further attaches the data received from connected at any position on the connector 72 and the 

the F bus to a CRC code and sends the code-attached data num ber of the mounting host adaptors 1 and the number of 

to the array disk 5 via the SCSI interface to attach the ECC 15 me mount in g disk adaptors 2 can be freely changed, 
to each of the magnetic disk :units and to ensure write data Meanwhile, when it is desired to increase the cache 

Even upon reading data from the array disk 5, the read c it the cache memory package 3 is replaced by the 

data is similarly inspected and corrected on the basis of each cache ^ package ^ Qf (he cache ^ package n ^ 

check code to improve its reliability. ^ mounted m addition to the cache memory package 3 and is 

As has been explained above, the check code is duplexed, connected to the cache unit 24 via the connection cable 23 

that is, horizontal check for every predetermined length is as snown in FIG. 7. As a result, the cache memory capacity 

carried out in the data length direction while vertical check can be i ncre ased by an amount corresponding to up to 8 

(for example, for every bite unit) is carried out in the data GBx2 m addition to the 2 GB capacity of the cache memory 

vertical (width) direction, so that one of the double check 2$ package 3 

codes is reliably transferred as data between transfer regions p, G g .. &n ^ Uve view of the ^ unit 

enclosed by a chain-dotted fane m ±e drawing) to compare frame FIGS SAand 5B for explaining how the 

the check code transferred as the data with a check code frame [ s wired 

created from the transferred data, whereby the data is n t ' t 

positively ensured In FIG * tne 00115111011 Dus 4 is wired on a back plane 71 

Shown in FIG. 4 is an appearance view of an apparatus for 30 fa its *>.° rizor » al <? ion , , he , form ; P.™'" 1 circ " it > th « 
implementing the scalability explained in connection with PJ* of the substrates (CP) of the cache port 

FIG. 1, which comprises two units, i.e., a control unit 41 for P a< **g es 22 > ,he mounting parts of substrates (Q of be 

controlling the array disk 5 and an array unit for mounting ^TT 1 * P ' I 868 P "? subs,r , ates f 

the array disk 5 therein (H) of the host adaptor modules 1 and the mounting parts of 

, .„ . ' ,. ,. ,,. , . 35 substrates (D) of the disk adaptor modules 2 are provided to 

FIGS. 5A and 5B show mounting diagrams of the control ., . , , ... . . . . . . . . 

-.a* u • ci^ >a • r . ■ Ti. t i nr ra the back plane 71, so that each substrate can be attached to 

unit41 wheremFIG.SAaafrontviewthe^fandFIG.SB and ^ £rom . baeaioo/nama alin side as 

is a side v,ew thereof respectively. In FIG. SB, reference showQ arrow 84 and when each substrate is inserted 
numeral denotes a logical unit frame part for mounting ^ substrate ^ electricall ^ 

therein the host adaptors 1, disk adaptors 2 and cache An . r . \ 

i i mZ u * c i • 4 40 to the common bus 4. 

memory package 3, 52 a battery part for supplying power to . 
the cache memory as a volatile memory in case of power Reference numeral 81 denotes an optical connector part 

failure, 53 a cache memory extension part for mounting mounted in lower parts of the substrates of the host adaptors 

therein the cache unit 24 and an additional battery for the 1 t0 fonn an interface with the host device, 82 an SCSI 

added memory at the time of adding a cache memory, 54 an 45 connector part mounted in lower parts of the substrates of 

SVP mounting part, 55 a switching power supply of the me ^ adaptors 2 and connected to the array disk 5, 83 a 

logical unit frame for supplying power to the logical unit connector for connection with the cable 23 between the back 

part, 56 an array disk mounting part when the configuration P lanes when the cache P ort P acka S^ 22 is mounted. Numeral 

(capacity) of the array disk 5 is small, 57 an array disk 85 deQ °tes a cache memory body (the cache memory 20 in 

switching power supply for supplying power to the array 50 nG * 2 ) mounted in the lower part of the substrate (C) of the 

disk 5, 58 a commercial power supply controller for sup- cache memory package 3. 

plying power to both the switching power supplies 55 and In order to improve operability upon removal of the faulty 

57. adaptor or package and upon insertion of a new one at the 

FIGS. 6A and 6B show mounting views of the array unit time of a failure occurrence, the connectors except for the 

41 when it is desired to arrange a large-capacity array disk, 55 connector 83 are not mounted to the operating side 84 but 

wherein FIG. 6Ais a front view thereof and FIG. 6B is a side concentratedly mounted on the side of the back plane 71. 
view thereof. FIG. 9 shows a configuration of software used in the 

The array disk mounting part 56 can mount therein up to present invention. 
112 (8 (row)x7 (column)x2) of magnetic disk units, and for Reference numeral 91 represents a channel adaptor con- 
easy exchange of a faulty magnetic disk unit, the mounting 60 trol program (which will be referred to as the CHP, 
part 56 employs such a mounting system that the faulty and hereinafter) written in the boot device 12 of the host adaptor 
new units can be removed and inserted from the front side 1. Of disk adaptor control programs written in the boot 
of the unit and from the rear side. device 12 of the disk adaptor 2, numeral 92 denotes a disk 

In FIG. 6B, reference numeral 61 denotes a cooling fan adaptor master control program (which will be referred to as 

for escaping heat generated in the entire unit. In order to 65 the DMP, hereinafter) for controlling the operation inherent 

enhance the cooling effect and from the viewpoint of sup- in the array disk and data transfer between the cache 

pressing noise, the fan 61 comprises a plurality of small memory 20 and the array disk 5, 93 denotes a disk adaptor 
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slave control program (which will be referred to as the DSP, Reference numeral 121 denotes a channel bus having 2 

hereinafter) for controlling data transfer between the cache ports. The host adaptor 1 is provided with 2 of the channel 

memory 20 and the array disk 5 under control of the DMP adaptor modules 29 and 4 of the channel buses for the host 

92. device, so that, at the time of a failure occurrence, an 

Written in the boot device 12 of the disk adaptor 2 are 5 alternate channel adaptor (CHP) and an alternate channel 

programs DMP 92 and DSP 93 having two sorts of functions bus arc used to enter into the degrade operation, 

so that, when *n' access the array disk, one of the disk Reference numeral 122 denotes a SCSI bus forming an 

adaptors is operated as the DMP 92, another is specified as interface between the disk adaptor 2 and the array disk 5. 

a spare for the DMP 92 (which adaptor is operated as the The SCSI bus 122 is duplexed so that another disk adaptor 

DSP 93), and the remaining (n-2) disk adaptors are operated 10 2 can gain access to a row of magnetic disks, whereby, when 

as the DSPs 93. the bus becomes faulty, the alternate SCSI bus is used to 

Reference numeral 94 denotes an SVP control program enter into the degrade operation. The DMP 92 for controlling 

for the SVP 27 for monitoring and maintaining the CHP 91, the array disk master specifies one of the DSPs 93 as the 

DMP 92 and DSP 93. When it is desired to renew each alternate DMP 92, such that, at the time of a failure 

control program, the SVP 27 can directly renew the control occurrence, the alternate DMP 92 is used to control the array 

program of the MP 10 stored in the boot device 12 of the disk master. 

adaptor to be updated or from another MP 10. The shared memory 18 and the cache memory 20 are also 

FIG. 10 shows shared functions of the software configu- duplexed so that, when one of the duplexed shared memories 

ration of FIG. 9 based on the data flow. becomes faulty, the other memory is used to enter into the 

The CHP 91 converts the address and data formats of the degrade operation; while, when one of the duplexed cache 

higher-rank or host device into address and data formats of 20 memories becomes faulty, write pending data (data remain- 

the lower-rank device and writes them in the cache memory *g ™ the cache memory) is destaged on the disk to perform 

20. Reference numeral 101 refers to a segment, 102 a block, °P er f on ™ m of the otner cache memor y 

103 a stripe indicative of a data amount per magnetic disk for the f f ^ cache mera ^ . #fc 

written in the array disk 5. The DMP 92 reads out a stripe u WheQ one of the magnetic disks in the array disk 5 

unit of data from the cache memory, converts a lower order 25 b ™?™* reading/writing operation is earned out 

address of the data into row No. column No, FBAand block whlle " U * .7^ / «^ nted a S P are 

number of the array disk, while the DSP 93 writes the data ma j?5 U <; 18 ***** re P laced ' .... .... 

. . rf' ir FIG. 13 is a diagram for explaining the multiplexed 

in the array disk. . , . _ structure of a power supply system and degrade operation. 

The DMP 92 also manages information as to configura- M g ^ commercial power controllers 58 are respec- 

tion of the array disk 5. tively duplexed on respectively independent AC input to 

Since the respective control programs share with the supply power to switching power supplies 55 of the logical 

respective functions as has been described above, when it is urut frame and to switching power supplies 57 of the array 

desired to change the host interface to an SCSI or fiber disk respectively, at the time of a failure occurrence, the 

channel, this can be realized only by changing the control 35 other commercial power source controller 58 is used to enter 

program CHP 91. Further, when it is desired to change the into the degrade operation. 

array disk configuration (disk row number/column number, Reference numeral 131 denotes a power supply control 
RAID (redundant array inexpensive disk) system, etc.), this circuit (which will be referred to as PCI, hereinafter) for 
can be realized only by changing the control program of the performing remote control of power ON/OFF from the host 
MP 92. When the interconnection of the host adaptors 1 and 4Q device, and for controlling commercial power source con- 
disk adaptors 2 is modified and the respective control trailers 58 and such power supply circuits for switching 
programs are rewritten, a scalability can be realized and a power supplies. 

burden on software development can be lightened. When the switching power supplies 55 of the logical unit 
FIGS. 11A and 11B are diagrams for explaining the frame are mounted by an additional number of 2, in addition 
duplexed common bus 4 and degrade operation. Reference 45 to the number necessary for the redundant operation to 
numeral 111 denotes a bus master (the host adaptor 1 or disk supply power to logical una frames 51 and batteries 52 
adaptor 2 having the MP 10) capable of acquiring an access ^°^ h the {^common bus even if two of the switching 
authority to tt/common bus 4, and numeral 112 denotes a P ower su PP hes 55 become fault ^ the s y stem can be °P er " 
bus slave (cache memory package) for receiving an access 3 ^ ^ 5? ^ 
request from the bus master |U. 50 array disk ? or supp i ying poW er to row unite of a group of 
The F bus 26, in its usual operational state, realizes a magnetic ^ m mounted by an additional number of 2, 
transmission rate of 400 MB/sec. based on 2 channels each ^ addi ti on to the number necessary for the redundant 
64 bit bus (200 MB/sec), and each bus channel can detect operation to supply power through the power common bus, 
a failure based on parity check or time-out. When a failure even if two 0 f tne switching power supplies 57 become 
takes place in one of the two channels, the bus master 111 55 faulty, the system can be operated. In addition, the system 
is put in its degrade state so that the remaining one channel can be rendered more inexpensive when both switching 
is used to have an access to the bus slave 112 and the then power supplies 55 and 57 are structured in a duplex con- 
degraded operation information is registered in the manage- figuration. 

ment area on the shared memory 18. In case of a power failure, power is supplied from the 

System control signals (including a bus reset signal) eo duplexed battery 52 via the power common bus to the cache 

within the common bus are increased in reliability because memory as a volatile memory within the logical unit frame 

the signal line is arranged in a triplex system so that 3-line and to the PCI 131, so that, even when one of the batteries 

(channel) coincidence system is employed in the usual becomes faulty, the system can be operated, 

operation and 2-line (channel) coincidence (majority FIGS. 14 and 15 are a table and a graph showing 

decision) system is employed in the degrade operation. es comparative system performances when the different array 

FIG. 12 is a diagram for explaining the multiplexed disks are made up of different storage capacities of magnetic 

structures of the respective parts and degrade operation. disk units. 
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More specifically, FIG. 14 shows the structures of array tors (optical disk connection logical modules), magnetic 
disks having the same capacity but using different types of tape units can be connected thereto through magnetic tape 

magnetic disk units. controllers (magnetic disk connection logical modules), or 

With regard to the array structure, 14 data disks and 2 semiconductor memories can be connected thereto through 
parity disks form a set. In the case of an item number 141, 5 semiconductor memory connection logical modules. 

3 GB of magnetic disk unit (3.5 -inch disk) is used and 5 sets Further, workstations can be also connected to the common 
of array structures are provided. In the case of an item bus 4 through another type of host adaptors. In this way, 
number 142, 4 GB of magnetic disk unit (5-inch disk) is used storage medium adaptors to various types of memories can 
and 4 sets of array configurations are provided. In the case be connected to the common bus. 

of an item number 143, 8.4 GB of magnetic disk unit 10 As has been explained in the foregoing, in accordance 

(6.4-inch disk) is used and 2 sets of array structures are with the present invention, there is provided a storage 

provided. system which comprises a plurality of logical units con- 

FIG. 15 shows a relationship between I/O instruction nected to a host device to form interfaces with the host 

issuance number per second and an average response time devic6( a storage unit> a p i ura ii t y Q f logical units connected 

with respect to the magnetic disk units 141, 142 and 143. In to me memo ^ s t0 form interfaces ^ tne st unit? and 

order to improve the transaction performance of the array memories (shared with the plurality of logical units 

disk system, when a small-capacity (small-diametered) mag- , , v , , Y ; ' ' , 7, . , 

netic disk unit is used to increase the array configuration, the connected to the host device and with the plurality of logical 

highest performance can be exhibited. For this reason, in units connected to the storage unit) for temporarily storing 

accordance with the present invention, the 3. 5 -inch magnetic therein data to be transferred between these devices and 

disk unit 141 is used to realize an array disk system. 20 units, and wherein the plurality of logical units connected to 

Accordingly, when comparison is made between a magnetic the host device, the plurality of logical units connected to the 

disk unit having the same storage capacity and made up of storage unit, and the cache memories are interconnected by 

a single large-scale magnetic disk unit and a magnetic disk a common bus shared with these devices and units. As a 

unit having the same storage capacity but made up of a result, there can be obtained a scalable system which realizes 

plurality of small-size magnetic disk units arranged in an 25 extension and modification of the plurality of logical units 

array, the latter array structure having the plurality of connected to the host device, the plurality of logical units 

small-size magnetic disk units is more advantageous connected to the memories, and the cache memories merely 

because its average access time can be shortened. adding or mod ifying these on the common bus, and which 

Shown in FIGS. 16 to 19 are model examples of the realizes easy attainment of its upgrade based on the extended 

structure of the system implemented with use of such a „.- t ™ lirmM - c ; rtn x^,. ■ ^ t , „ , • 1 w „ 

1 i_i l-. . r • j l system provision, further, since these logical units con- 
scalable architecture as explained above. ' , A r , 4 , . , ■ , - t j . *i_ 

w . 4 / L A , * nected to the host device, logical units connected to the 

More in detail, FIG. 16 shows an arrangement when the , , . , . M , - - 

number of the disk adaptors 2 mounted on the common bus mem ° nes and <* che u m i emones ™ ma f » f *>™ of 

4 is decreased and further the cache port packages 22 are K modules to be ^chably mounted to a back plane having 
mounted to be connected to the cache units 24 via the cable toe common bus disposed thereon, the necessary numbers of 
23 to realize a small-size disk array having high- these units and memories can be easily increased advanta- 
performance, large-capacity cache memories providing a geously. 

high cache hit rate. Since the logical units connected to the host device, the 

When the disk adaptors 2 are not mounted and the system 4Q logical units connected to the memories, and the cache 

is arranged only with the host adaptor 1 and cache memories memories are arranged in a duplexed form and the common 

(such an arrangement as enclosed by a broken line in FIG. bus is wired between these logical units and memory as 

16), the magnetic disks as the storage medium are replaced divided into 2 channels, even when one of these units 

by semiconductor memories and there is realized a high- becomes faulty, the other unit can be used to perform 

performance semiconductor disk system allowing higher- 45 degrade operation. In mis case, since any of the logical units 

speed data transmission. connected to the host device, the logical units connected to 

FIG. 17 shows an arrangement when a maximum number the memories, and the cache memories are provided with a 

of the disk adaptors 2 are provided, the cache packages 3 or connector allowing hot replace, the system advantageously 

the cache port packages 22 are provided to be connected to allows its maintenance and inspection for exchange of a 

the cache units through the cable 23 to thereby realize a 50 faulty part and also allows addition of parts for system 

large-scale disk array system having high-performance, extension while eliminating the need for halt of the system 

large-capacity cache memories. in operation. 

FIG. 18 shows an arrangement when the host device Further, since the storage unit is made in an array form of 
interfaces of the host adaptors 1 are replaced by such a plurality of combined small memories, the memory array 
interfaces as SCSI/fiber channels to reduce the mounting 55 can advantageously shorten an access time when compared 
number of the disk adaptors 2 and further the F bus 26 is to that in the case of using the conventional single large- 
made up of 2 channels having a bit width corresponding to scale disk unit. 

half of the bit width of the F bus to thereby realize a non-stop In addition, since the cache memory unit is made up of 

operation, high-performance fault tolerant (highly reliable) cache memory modules (cache memory packages) directly 

server system designed for an open market. 60 mounted to the common bus and the extending cache units 

FIG. 19 shows a simplest arrangement of FIG. 18 when no and a necessary number of the extending cache units can be 

consideration is paid to duplexed structure and hot replace to connected through the extending cache port packages 

thereby realize a server system designed for an inexpensive directly mounted detachably to the common busses, the 

open market. In the drawing, reference symbol 4D+1P number of cache units can be easily increased or decreased 

means 4 data disks and a single parity disk. 65 advantageously. 

In the foregoing embodiments, optical disk units can be As a result, there can be obtained a highly reliable storage 

connected to the common bus 4 through optical disk adap- system. 
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What is claimed is: cache memory device, wherein each bus in said pair of 

1. A storage system comprising: buses is adapted to transfer different data. 

a plurality of first logical units, connected to at least one , 9 A sto «8 e system as set forth in claim 8, wherein said 

host device, which form interfaces for the host device; P 1 ™ 11 * ° f fl f lo f ,cal UMts ' ^plurah y rfse^ndtogical 

. . 5 units, and said cache memory device are formed as modules, 

a storage device for storing therein information received each of which ^ detachably mounted to said common bus. 

from the host device; 10 A storage system as set forth in claim 8, wherein said 

a plurality of second logical units, connected to said common bus is disposed on a back plane, and said plurality 

storage device, which forms interfaces for said storage of first logical units, said plurality of second logical units, 

device; 10 and said cache memory device are formed as modules, each 

a duplexed cache memory device for temporarily storing of which is detachably mounted to said back plane, 

therein data transferred between said plurality of first 11. A storage system as set forth in claim 8, wherein said 

logical units and said plurality of second logical units; storage device has a plurality of small-size storage units, 

and said small-size storage units arranged in a horizontal direc- 

a common bus connected to said plurality of first logical 15 tion formin g M . error correction circuit group, said group 

units, said plurality of second logical units, and said bem S arranged in a vertical duection. 

duplexed cache memory device and having a two- 12. A storage system as set forth in claim 8, wherein said 

channel data transfer bus which operates as a pair of host de * ice * connected to said common bus through two or 

buses for transferring data among said plurality of first more of Polity of first logical units, 

logical units, said plurality of second logical units, and 20 13 A stora S e s y stem as ** forth m cUun }> wherein said 

said cache memory device, wherein each bus in said stora S e device 1S connected to said common bus through two 

pair of buses is adapted to transfer different data. or more of ^ P luraht y of lo ^ cal u J llts ; . 

2. A storage system as set forth in claim 1, wherein said 14 A stora S e s y stem as set forth m chun 8 > whe ' e m 
plurality of first logical units, said plurality of second logical plurality of first logical units and said plurahty of second 
units, and said cache memory device are formed as modules, 25 lo S ica l units eac h have a duplexed microprocessor and a 
each of which is detachably mounted to said common bus. checker for performing comparison in operation between 

3. A storage system as set forth in claim 1, wherein said said duplexed microprocessor. 

common bus is disposed on a back plane, and said plurality 15 A stora S e svstem as f ° rth m claim 8, wherein said 

of first logical units, said plurality of second logical units, common bus has a high-speed I/O bus divided into two 

and said cache memory device are formed as modules, each 30 channels for data transmission and a single-channel multi- 

of which is detachably mounted to said back plane. processor bus for transmission of control information for 

4. A storage system as set forth in claim 1, wherein said said data transmission. 

plurality of first logical units includes first logical units 16 - A stora & e s y stem as set forth in clam ? 8 > wherem f ld 

forming different interfaces for the host device. common bus is a two-channel, high-speed input/output bus, 

5. A storage system as set forth in claim 1, wherein said 35 and wherein, when one channel of said common bus 
storage device has a plurality of small-size storage units, becomes faulty, the other channel is used to continue opera- 
said small-size storage units arranged in a horizontal direc- tion. . . 

tion and forming an error correction circuit group, said 17 A stora g e svstem ™ set fortn 10 claim 8, wherein said 

group being arranged in a vertical direction. common bus further includes a one^channel multiprocessor 

6. The storage system recited in claim 1, wherein during 40 bus for transferring control information used for the data 
afaultconditionofonechannelofsaidtwochannelbus,the transfer ' and wherein, when said multi-processor bus 
other channel transfers data for both of said channels. becomes faulty, one of said two channels in said high-speed 

7. The storage system recited in claim 1, wherein each of input/output bus is used as the mula-processor bus and the 
said plurality of second logical units are connected to said other of said two channels of said high-speed bus is used as 
storage device by at least two paths, so that if a fault 45 said high-speed input/output bus. 

condition occurs on one of said paths, the other of said paths 18 storage system recited in claim 8, wherein during 

conveys data between said each of said plurality of second a fauU condition of one channel of said two channel bus, the 

logical units and said storage device. other channe l transfers data for both of said channels. 

8. A storage system comprising: 19 * ^ stora S e s y stem recited ™ claim 8 ' wherein each of 
. r ... j ~ « . a « en said plurality of second logical units are connected to said 

a plurality of multmlexed first logical un^ 50 ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 

a host device, which form interfaces for the host device; ^ ^ QnQ q{ ^ pa ^ ^ o ^ of ^ ^ 

a storage device for storing therein information received conveys data between said each of said plurality of second 

from said host device; logical units and said st0 rage device. 

a plurality of multiplexed second logical units, connected S5 20. A storage system comprising: 

to said storage device, which form interfaces for said a plurahty of duplexed first logical units, connected to a 

storage device; host device, which operate mutually independently 

a duplexed cache memory device for temporarily storing from one another, each of said plurality of first logical 

therein data transferred between said plurality of sec- units including a processor which, upon detecting a 

ond logical units and said plurality of first logical units; 60 faulty condition of another of said plurality of first 

and logical units, continues to perform processing of said 

a common bus connected to said plurality of first logical another first logical unit; 

units, said plurality of second logical units, and said a storage device for storing therein information received 

cache memory device and having a two-channel, high- from said host device; 

speed input/output bus operating as a pair of buses for 65 a plurality of duplexed second logical units, connected to 

transferring data among said plurality of first logical said storage device, which operate mutually indepen- 

units, said plurality of second logical units, and said dently from one another, each of said plurality of 
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second logical units including a processor which, upon 
detecting a faulty condition of another of said plurality 
of second logical units, continues to perform process- 
ing of said another second logical unit; 
a duplexed cache memory device for temporarily storing 
therein data transferred between said plurality of first 
logical units and said plurality of second logical units; 
and 

a common bus having a two-channel data transfer bus 
operated as a pair of buses for mutually connecting said 
plurality of first logical units, said plurality of second 
logical units, and said cache memory device, wherein 
each bus in said pair of buses is adapted to transfer 
different data, 

wherein said plurality of first logical units, said plurality 
of second logical units, and said cache memory device 
allow hot replacement to occur with respect to said 
common bus. 

21. A storage system as set forth in claim 20, wherein said 
plurality of first logical units, said plurality of second logical 
units, and said cache memory device are formed as modules, 
each of which is detachably mounted to said common bus. 

22. A storage system as set forth in claim 20, wherein said 
common bus is disposed on a back plane, and said first 
logical units, second logical units and said cache memory 
device in the form of modules are detachably mounted to 
said back plane. 

23. A storage system as set forth in claim 20, wherein said 
storage device has a plurality of small-size storage units, 
said small-size storage units arranged in a horizontal direc- 
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tion forming an error correction circuit group, said group 
being arranged in a vertical direction. 

24. A storage system as set forth in claim 20, further 
comprising: 

5 a power bus, a multiplexed power source part for supply- 
ing power to said power bus, and a duplexed power 
supply part for supplying power to said multiplexed 
power source part, said power bus being connected 
with said plurality of first logical units, said plurality of 

10 second logical units, said cache memory device, and 
said storage device, and wherein a number of power 
sources in said power source part corresponds to a 
number necessary to power said plurality of first logical 
units, said plurality of second logical units, said cache 

15 memory device, and said storage device plus one. 

25. The storage system recited in claim 20, wherein said 
faulty condition being detected is one selected from the 
group including a need to replace a hardware component of 
the system and a need to upgrade system software. 

20 26. The storage system recited in claim 20, wherein 
during a fault condition of one channel of said two channel 
bus, the other channel transfers data for both of said chan- 
nels. 

27. The storage system recited in claim 20, wherein each 
25 of said plurality of second logical units are connected to said 
storage device by at least two paths, so that if a fault 
condition occurs on one of said paths, the other of said paths 
conveys data between said each of said plurality of second 
logical units and said storage device. 

* * * * * 
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